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ABSTRACT 


An investigation is made into the feasibility of using 
mixing-ratio predictions employing multiple linear regres- 
Sion techniques applied to saturated mixing ratio profiles 
at specified levels. Three such sets of coefficients are 
determined from Ocean Station Vessel sounding data recorded 
for the months of January (1967-70, inclusive), and tested 
for effectiveness to determine a "best" prediction scheme. 
The resultant "best" set of coefficients is then compared to 
a previously designed regression-prediction scheme due to 
Weinreb and Crosby (1973) based on a Selection of island and 
coastal data divided equally in time and space. TestsS were 
performed to determine possible sources of error affecting 
the water vapor svecifications made by the multiple regres- 


Sion procedure. 
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I. INTRODUCTION 


Weinreb and Crosby (1973), (henceforth abbreviated WC), 
presented a useful technique for estimating atmospheric 
moisture profiles to assist in obtaining more accurate tem- 
perature retrievals from satellite soundings. Their estima- 
tion procedure was designed to derive the mixing ratios W at 
the lowest 40 levels of the 100-level atmosphere (McMillin, 
et al, 1973) defined in terms of linear j-increments of p2/7 


given by 


a = .01(1.+(.26087836)(j-1)?/? Ci) 


The saturated mixing ratio values Wok) at 11 key levels 
are chosen as predictors from among the iowest 40 j-ievels 
Seer (i). meine), ase = O1,...,100, ae ranges over 
Zolee.., O00 mb 1n the manner Shown in Table 1. Table 1 
also indicates the 11 fixed k-levels designated for computa- 
tional use of the W(k)-values employed as predictors. The 
data set used by WC to establish the regression coefficients 
was deduced from 1100 island and coastal radiosondes with 
humidities reported to 200 mb. The WC radiosonde sample was 
adistribubedsaim an approximately random manner with respect 
to season and latitude. 


The forty regression equations of WC were presented in 


ae LOnm 
n _ H _ 
eee -\u . i= . 2 
We = Wot 2 Ci CW CK )-W Ce) ) 3 JH... - 40 (2) 
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Table l. 


Level 
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Oona UNS Wh 


The 100 level atmosphere. 
j=l. « LOO 


Pressure 
ap 
(mb ) 


0100 
HOZ25 
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ak sia) 
72703 
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mls eke, 
»4322 
. 7746 
7 eo 
6288 
voy 
wa 20 
-4100 
» 1584 
go 1) 
- 9206 
seeleil 
-0758 
Oe ose 
Pio 3 
LS eloy 
14.7804 
16.5049 
Ten 7 20 
PAO) St IA oy 8: 
220209 
24.9765 
27.5024 


On DONNNE WWN DN FEF 


Level 


: 


Each j is an increment of pas 


Asterisked values are used for predictor 
levels in the regression analyses conducted.. 


Pressure 
P(j) 
(mb ) 


1) 


5 Uy 
70936 
ellie ys) 
SOL 
Ss eh 
- 6407 
5643 
Pere si72 
1104 
- 7487 
~6414 
5 SNe er 
AS. 
9217 
oon 
5 OZ 
8027 
7, 
mI2O9 
-4991 
wou 
.6484 
2057 
eo U2 
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~4501 
9868 
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eG 
. L209 
26 
.0774 


Level 


Pressure 
Ry 
(mb ) 


27 
284. 
299% 
Bale 
328. 
344. 
360. 
Biedas 
394. 
Al2. 
430. 
449. 
469. 
489. 
509. 
Sic dbe 
953. 
Dae 
Door 
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647. 
673. 
G92. 
23. 
753. 
Hae 
810. 
DaMe 
870. 
901. 
93% 
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1000 


2453 
8862 
0103 
6275 
7482 
3825 
5408 
236 
eZ 
2642 
6252 
5589 
0820 
20353 
9357 
2841 
2655 
8889 
1655 
1065 
i 2a 
0266 
0285 
7401 
1729 
3305 
2485 
Olay 
3486 
5623 
5674 
3760 
0000 








Equation (2) gives the format of the prediction equation 
where j3 is the level Cima miang ratio prediction: kK 1S 
the predictor level and W,, W_(k) the respective sample-means 
ope e W(K) from the 1100 soundings. As seen in Table 1, 
the arrangement of the predictor levels (denoted by aster- 
isks) takes primary advantage of the specification of the 
W.-profile in the lower levels, yet allows for additional 
upper level influences. 

The estimation method presented by WC demonstrated a 
strong multiple correlation between the predicted W, values 
and the indicated saturated mixing ratio as predictands. 
However, in the collection of 1100 sample soundings, strati- 
fication of the regression results of WC by season was not 
consivdemed. Additionally, in applicaticn tc the compuwbe tmien 
of precipitable water vapor for radiometric (VTPR) calcula- 
tions, island and coastal radiosondes were considered as 
being representative of open ocean areas. 

In this thesis several multiple regression estimation 
formulas for water vapor porfiles are Bnei based upon 
the ue and re Cs) data samples from only ten Ocean Station 
Vessels (OSV) distributed over the Atlantic and Pacific. 
However, the OSV data were based only on soundings from the 
month of January in the years 1967 through 1970, and so 
should possibly be considered representative of midwinter 
moisture predictions over the oceans. The regression formu- 
las derived were based upon 1081 dependent data samples and 


then applied for independent test purposes upon a smaller 
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data sample drawn from the same set of OSV-soundings. Eval- 
uation of the various estimation methods derived here was 
meme tved im Order to arrive at a "best" set of mixing—ratio 
specification equations. This prediction Leena was then 
compared with the WC prediction equations for the purpose of 
revealing the relative specification significance in the 
equations derived here. This evaluation was conducted using 
the smaller independent OSV data sample. Finally, a data- 
sample developed from 160 island and coastal soundings, col- 
lected from 1-28 March 1973 and similar to those used by WC 
(1973), was subjected to the "best" specification system de- 
veloped here and compared with independent testing based on 
the use of the WC specification system. 

Pmeowsiacrable amcunt cf linear interpolation is required 
to adjust sounding data to the 100-level atmosphere, specif- 
ically to the levels 61 through 100. In this thesis and in 
the work done by WC simple linear interpolation on pressure 


raised to the ge 


power as listed in Table 1 is used. It 
is reasonable to assume that the regression coefficients de- 
termined by WC are not independent of one another over the 
40-level system, and so could be reduced to a reasonably 
Conemsen but smaller predictand set, say J=1,...,19. For 
purposes of this thesis, the best-fit multiple regression 
equation in the form of (2) was therefore applied only av 
GaemEonleveds dicted in Table 2. It is to be noted that 
this set of 19 levels is a subset of the 40-level predictand 


set of WC and that it also includes the 11 levels used as 





Table 2. 19-level atmosphere. The pressure levels P(J) are 
obtained by linear scaling of P?/? over the interval 
188.45 mb to 1000 mb in 19 increments. Column 3 


indicates the subset of predictor levels. 


Predictand W_(k) Predictor 
Level Pressure Level Levels 

(J) Ea) (my) (kK) 
dl 188.45 

2 209.94 1 
S ZA aot 

4 PATO OE! 

) PESIS) (O)L 5 
6 C055 (LS 

(4 360.54 

8 412.26 4 
9 469.08 

10 DO9rdsS 5 
Lal SDS) 4 VAS. 

ap2 S797 6 

IL: 647.72 

14 699.03 6 
ie OS AT 7 
16 SO 2 oO 8 
Ny 870730 9 
18 3a OO se) 
19 1000700 alle 


nc ee 
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peedtetor-levels in their system. It was assumed that re- 
stricting Eq. (2) to 19-predictand levels would result in 
little, if any, loss of resolution of the estimated mixing 
ratio profiles over the lowest 40 WC predictand j-levels. 
Hence, comparison of the regression results deduced here 
will be with those results of WC at the 19-predictand levels 
adopted for use in this study. 

Regression analysis was carried out on the IBM 360 com- 


puter using the BMDO3R statistical program (Dixon, 1973). 


Py 





II. DATA PROCESSING 


A. OCEAN STATION VESSEL RADIOSONDES 

The raw data used in this study consisted of a set of 
January radiosondes containing temperature and relative hu-- 
midity data in accordance with standard radiosonde reporting 
procedures, i.e. available mandatory level plus any signifi- 
cant levels (Radiosonde Observations Handbook, 1969), from 
the 10 OSVs. Table 3 gives the approximate latitude and 
longitude for each of the OSVs in addition to the sample- 
Size contributed by each to the total data set. Also listed 
in Table 3 is a Similar breakdown of the sample size contri- 
butions to the total dependent and independent data samples 
(abbreviated as OSV-1, and OSV-2, respectively). The latter 
two data samples will be discussed later in this section. 

Each sounding included in the OSV data samples was re- 
quired to have temperature data up to at least 150 mb and 
relative humidity data to 450 mb. Both mandatory and sig- 
nificant levels were combined to produce temperature and 
relative humidity profiles at pressure levels arranged in 


order from 1000 mb to 150 mb. 


B. REDUCTION OF RADIOSONDE OBSERVATIONS 
Once a radiosonde was arranged in a pressure-decreasing 


order the Goff-Gratch formula in the form (List, 1963) 
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Table 3. Radiosonde-data sample sizes from Ocean Station Vessels, and 


their geographic locations. 








POSITION NUMBER OF SOUNDINGS IN SAMPLES 
OSV LATITUDE LONGITUDE TOTAL DEPENDENT DATA INDEPENDENT DATA 
OSV (OSV-1) (OSV-2) 

A 62.5N 33.0W 194 97 97 
B 56.5N 50.5W 154 77 77 
é 53.5N 35.5W 189 95 94 
it 59.0N 19.0W rae 106 106 
J 53.5N 19.5W 206 103 103 
P 50.0N 145.0W 202 101 101 
D 44.5N 41.0W 169 113 56 
E 35.0N 48.5W 185 123 62 
N 307 ON 140. 0W 205 136 69 
V 38.0N 165.0W 194 Hs 0 64 
TOTALS 1910 1081 829 





log(e./Po) = ~7.90298((T_/T)-1)+5. 02808 log(T_/T) 


-a34(1-T/T) 


Siessice< Or (10° ~ =) 


Weyi328 x 10-2710 °° 72 tot(Tg/7)-1)_4 (3) 


where Ai = 373.16°K, Py» = 1013.246 mb, and T = T(P) is the 
temperature at pressure P, was used to determine saturated 
vapor pressures We: From the value of en» WoCP) was deter- 
mined from 


HO eC 
Ss 


Wf?) = Pe a) 


which in turn yielded the actual mixing ratio W(P) upon appli- 


cation of the relationship between RH (as a fraction) and We 
W(P) = RH x Wo(P) Esp) 


Following this procedure, simple linear interpolation of 
both T(P) and W(P) relative to the [P(J)]?/’ values of Table 
2 was performed, yielding a temperature profile T(J) up to 
188.45 mb and a mixing ratio profile W(J) to 469.08 mb. 
Equations (3) and (4) were again needed to establish the 
11-predictor values of Wok); in this case T = T(k), P = P(k), 
k=...) 2., as Shown in Table 2. 

Seireceemex<ine ratio (or RH) profiles were usually unavaizl- 
able in the radiosondes above 450 mb, the well known cubic 


power-law profile of Smith (1966) expressing W in the form 


2 2 6 
ie Wace] P h #3 (6) 


r 
ee ‘ 
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was used. Here ct and a signify the simultaneous con- 
ditions at the reference level, taken here as py = 469.08 mb. 
Equation (6) was employed to compute "observational" values 
of W at all levels above eee to complete the 19 levels of 
Habie 2. 

Completion of the above procedures through (6) yielded, 
fiom each sounding, an array of 19 values of W and i1 values 
ow We: These values were arranged ina final 30-variable 
format [WC1),WC2),...,W(19),Wi(1),Wi(2),...,W C11) ). Abia 
format was found to be most convenient accessing the least 


squares statistical program, BMDOSR. 


C. DIVISION OF DATA SAMPLES 

The collection of 1910 OSV soundings was divided into 
two major data samples, OSV-1 consisting of 1081 total 
soundings, and OSV-2, comprised of the remaining 829. OSV-2 
data were obtained by selecting a random mix of every second 
to every third sounding from the total set of soundings 
available from each ship, with the remainder going into the 
dependent sample OSV-1. Table 3 lists the contributions to 
each sample on a ship by Ship basis. OSV-1 was used for de- 
termination of the dependent-test set of regression equa- 
tions, Lor MCI) specified in terms of the predictor~-variabies 
WoC1),-.-WiC11). OSV-2 was used for independent or verifi- 
cation tests of regression results developed using OSV-1. 

An additional subdivision of the OSV data was made to 
determme latitudinal effects, if any, on “the moisture esti-— 


mation regressions. This subdivision was suggested after 


Pall 


re 





Pometaeration of the distributions of the mean mixing ratios 
W(J) for each ship station in the OSV-1 data sample. 

Table 4 lists the mean W-values by level for the entire 
dependent sample set, the subdivisions of the sample accord- 
ing to $>S0N, and $<S0N, as well as by individual ships. It 
seemed plausible to use 50N as the dividing point for the 
latitudinal zones as this arrangement provided reasonably 
large sample sizes to stratify the regression analyses in 
addition to maintaining the apparent stratifications demon- 
strated by the mean distributions. Although Ship Station P 
was located at SON, its W-profile statistics seemed to be 
described more closely by the means of the northern set as 
its verification statistics suggest (Sec. IV.B.1). 

Verification of the estimation equations derived by the 
latitudinal stratification of the dependent sample set was 
PomMeNectoCmone ther stratit@deations (a) all 6, (b) ¢250N, (Ce) 
o<50N as well as for the individual ship-data samples re- 
served for the independent OSV-2 sample. The results of the 


tests on the individual ships are tabulated in Appendix B. 


D. ISLAND AND COASTAL RADIOSONDE SAMPLE 

Data collected under the auspices of the NATIONAL ENVI- 
RONMENTAL SATELLITE SERVICE (NESS) during March, 1973, were 
used for additional testing of the estimation procedures. 
Tmilemaata set provided a total of 160 typical radiosondes 
eontaining moilsmure data from the surface up to at least Llev— 


el J=9 of the pressure scale of Table 2 but with temperatures 
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Mean W-profiles for OSV-1 data, gm/100kg. 


Table 4 (a). 
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Standard deviations of the W-profiles corresponding to part a. 
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ipeto at least level 1. The format of the soundings was 
much the same as that of the OSV data except that the mois- 
ture element was given in terms of the dew point Ty rather 
than RH. This difference required the use of Eq. (3) with 
en and T replaced by e and Ty: respectively, followed by 
application of Eq. (4) to obtain the W-values. All other 
procedures in this section apply to the NESS data to obtain 
soundings in the form [WC1L),W(2),...,WCS), WiLL), Wo(2),... 
Wo(11)]), where J is at least as high as level 9. Recall that 
WoC1),WoC2),.-., Wo C11) depend simply upon having the temper- 
ature profile complete in the form of Table 2 to level 

P(2) = 209.94 mb, and that for purposes of computing Wok), 
the indexing k=1,...,11 denotes the indicated subset of the 


J-values shown in column 3 of Table 2. 


E. VERTICAL WEIGHTING 
With the mixing ratios estimated at most of the 19 levels 
of Table 2, it was possible to compute precipitable water 
vapor for each sounding as a measure of the vertically inte- 
grated mixing ratios. The precipitable water vapor, calcu- 
lated by means of the finite-difference integration, Eq. (7). 
JISSCSSS, 
CW 


z 
_ 1000 J=1is_°"* 
980 D 


vi) C= p 


ene J? 


(7) 
provided a basis for testing “estimated" precipitable water 
vapor against the corresponding "observed" U-values deduced 
from the W-profiles summed over the same atmospheric levels 


from level J downward. In the case of OSV soundings the 
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summation of Eq. (7) extended up to J=1; however in the case 
of the NESS data, the top level of humidity reports extended 


oeromietevel J in the range 1<J<9., 
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Ili. STATISTICAL CONCEPTS AND PROCEDURES 


A. MULTIPLE LINEAR REGRESSION 

This thesis makes use of multiple linear regression 
(MLR) techniques to develop an effective specification 
scheme to estimate mixing ratio values at the 19 previously 
designated pressure levels, using as predictors, the satu- 
rated mixing ratios obtained at the 11 key levels specified 
by WC. In the discussion to follow, the phraseology "de- 
pendent variable" refers to observed mixing ratios to be 
specified in terms of the independent variables when MLR 
techniques are to be applied. The independent variables are 
always the set of saturated mixing ratios computed at the 11 
specified levels in the vertical soundings. 

Assuming that the dependent variable Y depends on k in- 


dependent variables (X, ,X -»X,), and that there 1S a sam- 


ae 
ple of N observations of the kt1l variables (Y,X,,X%5,---5%,), 
MLR techniques can be ayy fee tomtinds the best-fit mw wpe 
regression coefficients oa: Y can be estimated by the re- 
gression equation 


Y=At+. 
3 


eas le se 


Cae. (8) 
1s ae 


where the superscripted symbol Y denotes “estimation of Y.~ 
Also it follows, from the requirement that the regres- 


Sion plane pass through the sample mean, that 
A= eee ON (9) 
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The quantities ( ) signify the respective means based on the 
sample size N. 
It should be noted that Eqs. 8 and 9 can be combined to 


produce the form 


ie a C ,(X,-K,) (10) 


where the oa are the best-fit regression coefficients de- 
termined by MLR techniques. 
The regression coefficients of (10) are determined by 


solving the system of norinal equations given in determinant 





form by 
hey <7 hie tk Dy. 
a a a 
21 202 2k = 2V aay 
ee »C) Z ~ | ’ {it ) 
Ay tte Bie Ane 
where a.. and a. are given by 
aby 1y 
I 
= =< ee 12 
455 Me an XSi ee ve 
and 
N a _ B55 
ae) ni Ba $ 
Bay = men Sim % iy) Om *) 
respectively. 


m 
In Eqs. (12) and (13) it is understood that rs y 


(m) 
Xe 


fication the superscript in this sample designation has been 


) constitute the mth sample of Y on X.. For simpli- 
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Vue aS A Subseript in (12) and (13). From Bq. (12), 

255 = 85a> and therefore, the matrix of fa; 3 in (11) is syn- 
metric. Equation (11) represents a system of k equations in 
the k unknowns C. which can be solved for a unique set of 
regression coefficients, provided the k-by-k determinant of 
the left hand side of (11) is non-singular. If the multipli- 


ecamleneoL Che left hand terms of (11) is carried out, one 


can obtain k equations in the form 


aif, ote arf, Trees oy ot A140), = ae (14) 


Miitewmcan be SOlved (Crow, et al., 1955) by matrix inversion 


methods applied to {a;5}. 


B. SIGNIFICANCE TESTS 
1. On Dependent Regression Equations 
The coefficient of determination of an MLR technique 
is definable as the ratio of the sum of squares due to the 
regression to the total sum of squares. This parameter may 
be shown to be the square of the multiple correlation coeffi- 


. 2 
cient, Rae 


N 
(Naa )e 
R2 = Sum Popup lal nedeny nes ase Seeker a3 (15) 
M Sum Sq. of Total Deviation Neen apsee 
We Coe) 


and is used as a measure of the efficieney of the regression 


performed by the BMDO3R program. The total sum of squares 


N N 
SOME ae = Seng aye =Y > =n 1)S- 16 
eoeetot ee = 2 (ea il (N-1)S, (16) 
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defines the variance Sy. S.S. total may be expanded 


N N N 
= y _v 2 : es _v 2 y _ es Z 
omce coralg=. 2 4v.-Y) fey OL On Cee (17) 


(a) + (Db) 


which is a combination of the S.S. due to regression (a), 
Bettie eresidual S.S. (bp). 

If the regression were perfect, term (b) of (17) 
would vanish and Ry would be 1; conversely, if Y were totally 


independent of X,, a would be 0. Combination of Eqs. (15) 
and (17) shows that the residual sum of squares after appli- 


cation of the regression (8) is 


N 
= —~ R2 y =y }- 
Res. 8.58. (1 Ry? 92103 x) 
A method of determining the significance of a predic- 
mon equation havang k predictors is the F-statistic with k 


and N-k-1 degrees of freedom, defined by 


ae se iican, oduare Explained by the Prediction 
Era). Mean Square Unexpl. after the Prediction ’ (18) 
ow 
R,,/k 
a) —C (19) 


(1-R,,)/(N-k-1) 


Holton oa1s a direct application of the deiini— 
miGmeGkonmot the F-statiseic fom the appropriate degrees sor 
freedom. For the numerator of (19), k is the number of in- 
dependent variables used in specifying the mean square ex- 


mieimed of (18). ‘In the denominator of (19) the degrees oF 
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freedom (df) is equal to N-k-1, corresponding to the total 
sample size, reduced by the 11 Known coefficients and the 
value of the sample mean. In the tests on the dependent 
sample k=11 throughout, although N varies from test-sample 
to sample. 

Critical values of the F-statistic, designated Bo 
can be found in standard texts on statistics for the appro- 
priate degrees of freedom at specified levels of signifi- 
cance. If the F-statistic of (18) exceeds F, 9;,(k,N-k-1), 
there is less than 1% probability that the sample came from 
a population in which there is no linear relationship be- 
tween Y and X,. Such a statistical outcome for the regres- 
sion is described as corresponding to a 99% level of 
confidence. 

A third related statistic which is listed as output 
by BMDO3R is the standard error of estimate (after applica- 
tion of the regression) is defined by 


vy \2 
S.S. Errors _ ae 


2 
(STD. ERR.) = “(Ty =~ “(N-ESD) 


(20) 


Equation (20) shows that the standard error is simply the 
Square root of the mean of the errors squared. The standard 


error is related to a by 


(N-1)S85(1-R,,) 


(N-k-1) 2) 


(STD. ERR. )* = 
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Piecmenbecemoctacistico: defined dmgGls), (18), andec2o) 
were considered as comparative measures of the MLR techniques 
of this work and were conveniently provided as a standard 
output from the BMDOSR program. Values of Rap SVD aes 
and F are presented in tabular form in Section IV, based 
upon MLR techniques applied to relevant subdivisions of both 
dependent and independent samples. 

2. On Independent Data for Level by Level Analysis 

Similar test procedures were applied to evaluate es- 
timates of the regression methods from Sec. JII.A.1 to the 
independent OSV-2 sample. Slight variations in the degrees 
of freedom were required in Specifying the efficiency sta- 
tistics defined by Eqs. (15), (18), and (20) to the inde- 
pendent sampies. In the regression analyses applied to the 
dependent sample, 11 predictors and the mean were used in 
arriving at N-12 degrees of freedom in the denominator. 
However once the coefficients were applied to the OSV-2 sam- 
ple, Suiemerwaseessentially only one way to form the estama-— 
ELOns Y, using the prescribed set of coefficients. Therefore, 
in this section the df corresponding to k was taken as l. 
R* was also somewhat modified and was termed simply ReRF? 


M 


defined as the effective coefficient of determination. The 


Z 
ROPER values were defined by 
Seo. Ce OorS 
2 = OO a ae Se ee Z. Z 
ROFF = a Seo. LOtat eee 
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Bie F=statistic°™and the standard error of estimate 
wetme Calculated in a manner similar to Eqs. (19) and (20) 
wav k-1, but with N-k-1 replaced by N-1. The resulting ex- 


pression for these statistics 


R2 


oe : Jaleo (23) 
and 
z(¥,-¥,)? 


were used when conducting significance tests on the inde- 
pendent data set. 

The statistics were collected and tabulated at each 
of the 19 prediction levels. Comparison of their values 
from the various estimation methods provided a means of de- 
termining whether a latitudinally dependent specification 
scheme over the OSV sounding samples was more CCT than 
a composite-latitude specification scheme. After determina- 
tion of a "best" scheme, specification usefulness was com- 
pared in the same manner with the WC prediction scheme. 
(Here WC denotes the Weinreb-Crosby regression system. ) 

3. Independent Data by Precipitable Water Vapor 

Precipitable water vapor values, both actual and 
predicted, calculated by Eq. (7) were subjected to a simple 
linear regression of U on U by BMDO3R, in order to obtain a 
Vertically weighted comparison of the various specifications. 


The three statistics described in Section III.B.1 were used 
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to judge the effectiveness of the estimates by the various 


methods. 
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IV. STATISTICAL RESULTS 


A. DEPENDENT SAMPLE 

This phase of the research consisted of collecting, 
tabulating, and evaluating the results of three MLR tests 
carried out by BMDO3R on the dependent OSV-1 sample of sound- 
ings. The first MLR provided the 12 coefficients and the 
statistics for each of the 19 prediction equations to be 
applied over the composite sample of ship-soundings. The 
second and third MLR's provided similar sets of coefficients 
and statistics for the latitudinal divisions of the soundings 
POgeooONeand -90N, respectively. These three sets of re- 
gression equations, employed in the form of Eq. (8), will be 
referred to as: 

1) COEF-1 for the one over-all set, regardless of 90, 

2) COEF-2N,S using 

a) The coefficient set COEF-2N when $>50N, and 
b) COEF-2S when $<50N, 

3) The composite set of COEF-2 without the N or 8 

designation. 

The latter set implies a composite estimation by one or 
other of the two components of the applicable equations in 
the appropriate latitudinal zones. The sample size is the 
same as for COEF-1. 

Table 5 lists the values of Rag the SlLandeard error 1:6 
estimate, and F as obtained from BMDO3R for each MLR experi- 


ment at each of the 19 levels listed in Table 2. The most 
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sulimcent Critical F-value, PO (k,N-k-1), for the entire dis- 
play at the 1% significance level occurs for the degrees of 


freedom indicated by 


F (11,490) Sean BS. 


which occurs in the southern subdivision of OSV-1. As can 
be seen from Table 5, all F-values exceed this critical val- 
ue by a rather impressive amount. The inference was drawn 
that each estimation procedure so determined was based ona 
strong linear dependence of W on Wok). Additionally, com- 
parison of the standard errors of Table 5 with the corre- 
sponding OSV-1 sample standard deviations (listed in Table 
Apenditeavtes a Saegnificant reduction of the variances by the 
regression estimates. 

An example of a regression equation at level J=19 (cor- 


responding to 1000 mb) is given by 
W(19) = 2.1082+0.4979 W ,(1)-0. 5900 W (2)-1.1938 W (3) 

-0.1600 W (4) +0. 2194 W(5)-0. 2095 W(6)-0.1151 Wo) 

OZ oles W , (8) +0. 0815 W (9) +0. 9076 W ,(10)-0.0313 W (11) C25) 
Equation (25) is one prediction equation of the set of 19 
equations developed from COEF-1. A listing of the three co- 
efficient sets comprising COEF-1, COEF-2(N,S) as well as the 
WC coefficient set is contained in Appendix A. It should be 


lepedutiat the L1rst term on the right side of (25) results 


from Eq. (9) with all the sample means known. 
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Examination of Table 5 does not reveal convincing evi- 
dence with which to determine a "'best" prediction method be- 
tween COEF-1 and COEF-2 as all F-values demonstrate at least 
a 1% significance level for dependent W-specifications. For 
purposes of this study, a highly significant improvement in 
specifications by COEF-2 over COEF-1 would be required. This 
requirement was deemed essential to override the added com- 
puter storage and procedures required to implement a double 
set of coefficients. Clearly, Table 5 does not offer this 


evidence. 


Bee INDEPENDENT SHIP SAMPLE 


1. An Illustration of the Usefulness of Latitudinally- 
Stratified Coefficient-—-Set 


Recall that in Section II, the question of a latitu- 
Gamal stratifieation of the water vapor prediction equations 
was raised. The reason for considering this approach is 
well illustrated by the level-by-level results for Ship P 
presented in Table 6 (based here upon independent test data). 
The results of Table 6, e.g. larger values of RoPR resulting 
from the COEF-2N regression scheme, indicated stronger sig- 
nificance level-by-level than those of either COEF-1 or 
COEF-2S coefficient sets when applied to the same data. 
These results, as much as any individual ship results, indi- 
cated the need to consider, initially, a latitudinally- 
stratified set of predictors, with the boundary set at 9>90N. 
inmmaicdcmeron, ©OHE=) regressions consider the alternate hy— 


pothesis, namely, the case of no latitudinal Strait) ) wea Glomer 
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Note that like results for the remaining nine ships 
of the OSV-2 sample are listed in Appendix B in the order 
Spectiied by Table 3. 

2. For the Determination of a "Best" Set of Coefficients 

a. Level-by-Level 

Water vapor estimates were made by COEF-1 and 
COEF-2N,S for the entire independent OSV-2 sample. The re- 
sults are to be tested level-by-level as follows: 
i. COEF-1 vs. COEF-2N for latitudes $>50N 
di. COEF-1 vs. COEF-2S for latitudes 9<S0ON 
iit CORP=i ys. COKF-2 for all ¢ 
The statistical results for the three comparisons are listed 
in Tables 6 through 8, respectively. As in Section IV.A the 
Value fcr a 1% significance level was taken for the most 


stringent case, i.e. case (11) was 


FO (1, 250) = 6.75 


The summary of F-statistics in Table 7 for $>50N 
shows strong predictive capability at all levels in both 
COEF-1 and COEF-2N, with the latter demonstrating slightly 
Higher Verification (F-value) statistics. The results of 
COEF-2S were not as strong when compared to COEF-1. Table 8 
Shows slightly better verifications with COEF-1 from level at 
through 11, then a reversal for the remaining eight levels. 
Note that at level 9, for the COEF-2S tabulation, the value 


R...=0, but also F<O. The R2,,-value, as defined by Eq. (22) 


EFF EFF 


takes on negative values whenever the mean square error of 
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the estimation scheme exceeds the sample variance. For this 
case, the sample-mean must be taken as the best prediction 
scheme, and RPP is Geset. to 0 ineoperlmiis aercalacst re 
value of Ropr: However the F-value was left as calculated 
by Eq. (23), with k=1 and N-1 degrees of freedom. The sta- 
tistical comparison of COEF-1 and COEF-2, listed in Table 9, 
for the entire OSV-2 independent sample was a composite of 
the results of Tables 7 and 8. Both estimation methods 
demonstrated strong prediction qualities, as evidenced by 
the large F-values. As in the regression analyses in Section 
IV.A, the comparison of the performance of COEF-1 and COEF-2, 
Table 9, is quite close over most of the 19 levels. Hence, 
there was no conclusive evidence supporting a decision to 
select a "best" set of coefficients from either COEF-1 or 
COEF-2. This decision was deferred to the outcome of the 
regression results relating observed to estimated values of 
precipitable water vapor. 
b. Precipitable Water 

Precipitable water calculations (U) were made on 
mierOsv=2sample soundings according to Eq: (7). Similar 
aooliecation of (7) to the 19-level OSV-2 mixing ratio speci-= 
fications generated by COEF-1 and the composite set of 
COEF-2 equations produced two modes of estimated U-values 
for each of the 829 soundings, depending upon the latitude 
of the sounding. 

The simple linear regression analyses were per- 


formed on each set of U, U using BMDO3R. Each regression, 
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ientical to the MLR withWenly ome independent variable, was 
performed with U as the dependent variable and the appro- 
priate U as the independent variable. These regressions 
provided significance measurements of the 19-level predic- 
tions by a Single vertically-scaled F-statistic with 1 and 
N-2 degrees of freedom. 

The statistical results from the precipitable 
water vapor analyses are listed in Table 10 and provided a 
final comparison between the COEF-1 and COEF-2 specification 


systems. The critical F-value, 


Eo = 7.38 
e 


corresponding to the sample of N=251 soundings (¢<50N) pro- 
vided the most stringent F-statistic significance require- 
ments namely at the 1% level. As seen in Table 10, all 
calculated F-values greatly exceeded the critical value, 
tis In@aveatang, to at least the 1% level of significance, 
that none of the specifications of precipitable water could 
have occurred by Bites 

The precipitable water results were analogous to 
those in Tables 7 through 9, and as seen in Table 10, the 
Single COEF-1 set of equations yielded the most effective 
specifications. Considering the conditions set down in Sec- 
Glen!’ bel requsarine “considerably better Specification by 
COEF-2 over COEF-1, and with the results of Sections IV.A 
ancielye Bye pie decision was madevat thils»point to eliminate 


ivometurther consideration the model of a latitudinally 
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diteiecrentiated set of coefficients. Thus the remainder of 
this research incorporated only the composite COEF-1 set of 
equations for comparison with the WC specification method. 
Table 10. Precipitable water vapor regression statistics of U on U by 


COEF-1 and by the appropriate COEF-2 scheme applied to indé- 
pendent OSV-2 sample. 


COE al COEF-2(N,S, Composite) 
STD STD STD 

AREA MEAN DEV ERR R F ERR R F 
$>50 sles a Us mee ee Oo OL ey Ore 72a CUS LO Ooo 
(N=578) 

o<50 5 674 O04) ee Oe Ome eae .3902 .8161 496.4 
(N=251) 

All 9 ee . 606 » 200 leo 0062855216 eo eo 7 4 Ae) cera 


N=829) 


Operational implementation of a single set of 
coefficients over a more complex set eliminated the require- 
ment of additional computer core allocation for storage of a 
second set of coefficients and the necessary logic state- 
mMemts to enaet the stratification differentgation. Clearly, 
@ae current results do not warrant use of the more complex 
COEF-2 prediction scheme. 

oe Comearison Waths the WC Predicrens 
a. Level-by-Level 

In this section comparison of estimates produced 

by the "best'' COEF-1 coefficient set was made with those es- 


timates from the WC predictions. Comparison was conducted 





in accordance with the precedures discussed in Section 
IV.B.1.a. As in previous comparisons, the most stringent 
conditions were adhered to for determination of an E-value, 


moe this case 


FA (1, 250) =O 


Statistical comparison of the two specification methods are 
ipsted in Tables 11, 12, and 13 for the latitudinally strati- 
fied subsamples (OSV-2N, OSV-25S) and for the total OSV-2 
sample, respectively. 

Without exception, the COEF-1 specifications 
were Statistically superior to those produced by the WC co- 
efficients at all 19 J-levels in each of the three tables. 
The WC specifications do demonstrate considerable predicta- 
bility in the lower levels of the atmosphere, however, the 
F-values indicate poor results in the highest two or three 
levels. Some possible explanations will be offered in Sec- 
tion V for the breakdown of the upper level specifications 
by the WC equations. It is clear from the results listed in 
Tables 11 through 13 that the COEF-1 specifications are su- 
perior to the WC predictions for the level-by-level analysis. 

bee Precipictable Water 

A final comparison of the WC and COEF-1 specifi- 
cation methods was made uSing precipitable water vapor cal- 
culations as in Section IV.B.1.b. Table 14 lists the 
Statistics resulting from the vertically-scaled evaluation 


of the two water vapor estimates for each of the OSV-2 
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Sttatitications. The statistics were generated from two 
Simple linear regressions by BMDO3R of U on each of the je 
estimates. The last line of Table 14 indicates a definite 
Superiority of the COEF-1 equations over the WC equations 
for the independent OSV-2 data sample. 


Table 14. Precipitable water vapor results by COEF-1 and modified WC 


coefficients applied to independent OSV-2 sample. 


STD ae, STD 
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(N=251) 
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C. ISLAND AND COASTAL (NESS) DATA 
1. Level-by-Level 

The 160 NESS island and coastal March soundings pro- 
vided a test sample that focussed, to a degree, on evaluation 
eembdewpecsciple significance of winter ef1ecrs.on W-proiile. 
iment oupresents the statistics obtained by the first er— 
forts at estimating the W-profiles by COEF-1 and WC coeffi- 
cients. In making these first W-specifications all 160 
soundings (with observed W-values up to at least 510 mb) 
were accepted as valid data. The results presented in Table 


iMeewere at best discouraging for both sets of coeificients, 
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The negative F-values, starting at level 13 for the COEF-1 
specifications and at level 9 for those by the WC method, 
indicated considerably greater effectiveness would be achieved 
by making predictions with the sample mean values of the mix- 
me ©ravios. 

It was noted that a number of the NESS soundings 
should not have been considered reliable after reviewing the 
actual W-profiles. Unreliable soundings were identified 
when the reported moisture profiles indicated RH<20% in the 
lower levels of the soundings and persisted with increasing 
ascent. It was felt that the low moisture reports were due 
mostly to a failure of the moisture element on the radio- 
sonde instrument. 

If a sounding had RH considerably higher than 20% at 
all levels, its moisture profile was considered reliable and 
the sounding was retained. In some cases the relative hu- 
mrdity decreased to values lower than 20% in the mid—tropo- 
sphere but then recovered to exceed 20% at higher levels. 
Such soundings were also retained in a revised test set. If 
biemhin persisted at less than 20% throughout most of the 
troposphere, including the upper regions where motor-boating 
is not infrequent, the sounding was considered unreliable 


amcrediscarded (see Table 16). 
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Table 16. Examples of three types of soundings categorized 
by RH profiles extending up to level 10 and con- 


Sidered for possible inclusion in the NESS sample. 


(% RH) 


Level Reliable Recovered Unreliable 
10 CP 71 15 
lial 67 63 16 
eZ 62 23 16 
13 45 11 17 
14 42 9 18 
15 43 11 Zi 
16 42 16 20 
17 37 31 18 
18 O77 29 67 
1S, 79 38 76 


After screening the NESS soundings uSing the method 
Meek ecelo wma wOoual of 124 soundines remained for testing. 
Table 17 lists the statistical results of the level-by-level 
W-specifications by the COEF-1 and WC equation sets. Since 
the number of soundings with observed W-values drop off 
Sharply above level 10, the critical F-value was arbitrarily 


taken for N=70 occurring at level 7 and 
F = 7.01. 
G 
ips: value oft i is considerably more stringent than neces- 


sary for specifications at levels below level J=7, but as 


liable dy demonstrates, this is a reasonable restriction. 
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Examination of the F-values of Table 17 indicated 
that the WC coefficients performed more effectively in the 
lower levels of the atmosphere. Above level 16, the COEF-1 
set of equations produced statistically superior estimates 
up to level 4, where both estimation procedures produce neg- 
ative ROPE and F-values. The small number of soundings con- 
homme data in the last four levels was considered responsible 
for these unusual statistical outcomes and it was impossible 
to draw any useful conclusion regarding the comparative 
upper-level Specificity by the two estimation systems. 

2. Precipitable Water Vapor 

Precipitable water vapor integrations were performed 
using Eq. (7) on each of the 124 NESS soundings, incorpo- 
meee all available J-levels for which mixing ratio values 
Ppelstod.. J<9. Additionally, U-values were calculated using 
only the corresponding levels for the associated U-integra- 
PLOnsS tp tO levedad . 

The statistical results, as produced by a BMDO3R 
Simple linear regression of U on each of the two U's, are 


displayed in Table 18. 


Table 18. Statistical results from BMDO3R simple linear 
regressions on precipitable water vapor calcula- 
tions obtained from the 124 NESS observed and 
COEF-1, WC-estimated mixing ratio-values. 








COEF-1 WC 
STD STD S BUR 
MEAN DEV ERR R rE ERR R is 
1.44 ag fa 27 oe LO GOOre .4850 .8990 ane) 
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The critical F-value 


Fi(1, 122) = 6.84 


was compared to the calculated F-values of Table 18 to eval- 
Were setae Significance of the vertically scaled mixing ratio 
specifications. It was readily apparent that the U-estimates 
both by the COEF-1 and WC systems demonstrated high statis- 
tical significance and that the COEF-1 estimates indicated 
somewhat higher predictability. 

Reviewing the results of Table 18 along with those of 
Table 17, it became obvious that the near-surface mixing 
ratio values dominate the precipitable water vapor integra- 
tion, and that the upper level breakdown of the WC specifi- 
cations, as evidenced by Table 17, was masked by this 
dominance. Note that the W-values of the dependent and in- 
dependent OSV data sets were determined by Smith's power law, 
Eq. (6), for levels above J=9. On the other hand, the W- 
values for the NESS data set were taken solely as indicated 
by the radiosonde instrument, as were the data used by WC in 
determining their regression coefficients. Application of 
the power law to the OSV data tended to smooth the W-profile 
at higher levels and yielded a degree of consistency in the 
OSV W-profiles in the highest 5 prediction levels (P<300 mb), 
giving small W-values over the OSV's at levels higher than 
J=5 as compared with those resulting from the NESS sounding 
samples (Table 17). It is felt that the high values of W in 
Ene NESS data could be attributed largely to inaccuracies in 


the radiosonde observational data at the higher levels. 
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V. CONCLUSIONS 


This thesis has reinforced the findings of Weinreb and 
Crosby (WC) in providing supporting evidence that mixing 
ratios can be described by a linear combination of a desig- 
nated 11l-element temperature T(P)-profile, which is equiva- 
lent to a profile of WoCP). Further, it has been shown that 
any dependence of the relationship between temperature and 
mixing ratio on latitude is insignificant, and that one 
Single set of specification equations can be employed over 
all latitudes. 

It was evident from the statistical results presented in 
section IV.B.2 that the performance of the coefficient set 
(COEF-1) was superior to that of the WC coefficient set over 
the Ocean Station Vessels. However, the causes of the im- 
proved specifications were not regarded as a result of a 
land-mass (coastal) or seasonal effect. Rather, inaccuracies 
inherent in radiosonde moisture data in the higher level 
atmospheric structure may have affected the definition of 
the WC coefficients, which were derived by moisture data of 
accuracy comparable to that of the NESS data. This reason- 
ing is supported by the independent tests of the COEF-1 and 
WC specifications on the NESS data. 

Efforts to produce an accurate prediction scheme by MLR 
techniques, as demonstrated by this work, appear extremely 
promising. Effective prediction procedures seem not to be 


associated with seasonal, latitudinal or land-mass (coastal) 
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effects. Rather, effective prediction appears to depend 

upon an accurate data base of sufficiently good quality from 
which to define accurate coefficient sets over the depth 
1000-200 mb. The results of this research indicate that the 
Gav data provide such a data base, and that mixing ratio es- 
timates by a set of equations of the form derived here pro- 
vide a representative profile of W(P) up to approximately 

200 mb. The high level of predictability of the precipitable 
water vapor resulting from the vertical integration of mixing 
ratio estimates provided substantial evidence supporting the 
conclusion that the OSV data define highly effective regres- 


sion equations. 





APPENDIX A 


This Appendix contains a listing of the three sets of 
regression coefficients COEF-1, COEF-2N, and COEF-2S deter- 
mined by BMDOSR from the OSV-1 Ship sample. A fourth list- 
ing contains the selection of coefficients extracted from 
the WC set described in Section I. 

Each of the four sets of coefficients was applied accord- 
ing to the following matrix operation to produce the appro- 


priate mixing ratio estimates at level Jd: 
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APPENDIX B 


Appendix B lists the level-by-level statistical results 
obtained by application of coefficient sets COEF-1, COEF-2N, 
and COEF-2S to the individual OSV-2 soundings. The listings 
are arranged in the order specified by Table 3, only that 


Ship Station P is not included here (see Table 6). 
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